BTS: Resource capacity estimate for time-targeted science workflows
نویسندگان
چکیده
Workflow technologies have become a major vehicle for easy and efficient development of scientific applications. A critical challenge in integrating workflow technologies with state-of-the-art resource provisioning technologies is to determine the right amount of resources required for the execution of workflows. This paper introduces an approximation algorithm named BTS (Balanced Time Scheduling), which estimates the minimum number of computing hosts required to execute workflows within a user-specified finish time. The experimental results, based on a number of synthetic workflows and several real science workflows, demonstrate that the BTS estimate of resource capacity approaches to the theoretical lower bound. The BTS algorithm is scalable and its turnaround time is only tens of seconds, even with huge workflows with thousands of tasks and edges. Moreover, BTS achieves good performance with workflows having MPI-like parallel tasks. Finally, BTS can be easily integrated with any resource description languages and resource provisioning systems since the resource estimate of BTS is abstract. © 2011 Elsevier Inc. All rights reserved.
منابع مشابه
Efficient Resource Capacity Estimate of Workflow Applications for Provisioning Resources
Workflow technologies have become a major vehicle for the easy and efficient development of science applications. When integrating the workflow technology with the state-of-art resource provisioning technology, the challenge is to determine the amount of resources necessary for the execution of workflow. This paper introduces an algorithm named Balanced Time Scheduling (BTS), which estimates th...
متن کاملCost optimized provisioning of elastic resources for application workflows
Workflow technologies have become a major vehicle for easy and efficient development of scientific applications. In the meantime, state-of-the-art resource provisioning technologies such as cloud computing enable users to acquire computing resources dynamically and elastically. A critical challenge in integrating workflow technologies with resource provisioning technologies is to determine the ...
متن کاملA Data Management Framework for Urgent Geoscience Workflows
The emerging class of urgent geoscience workflows are capable of quickly allocating computational resources for time critical tasks. To date, no urgent computing capabilities for data services exists. Since urgent geoscience and Earth science workflows are typically data intensive, urgent data services are necessary so that these urgent workflows do not bottleneck on inappropriately managed or ...
متن کاملCrossroads - Spring 2010
14 Spring 2010/ Vol. 16, No. 3 www.acm.org/crossroads Crossroads Besides public data repositories, scientific collaborations maintain community-wide data resources. For example, in gravitational-wave physics, the Laser Interferometer Gravitational-Wave Observatory [3] maintains geographically distributed repositories holding time-series data collected by the instruments and their associated met...
متن کاملA Multi-objective optimization model for project scheduling with time-varying resource requirements and capacities
Proper and realistic scheduling is an important factor of success for every project. In reality, project scheduling often involves several objectives that must be realized simultaneously, and faces numerous uncertainties that may undermine the integrity of the devised schedule. Thus, the manner of dealing with such uncertainties is of particular importance for effective planning. A realistic sc...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- J. Parallel Distrib. Comput.
دوره 71 شماره
صفحات -
تاریخ انتشار 2011